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METHODS FOR DETERMINING COAT COLOUR GENOTYPES IN PIGS 

/ The present invention relates to methods for determining coat colour genotype 
/ in pigs. In particular, it relates to methods of distinguishing between the alleles 
/, F, f and / of the KIT gene. 

Coat colour is important to the pig breeding industry for a number of reasons. 
Firstly in a number of markets there is a preference for white skinned meat 
This is due to the fact that pork is often marketed with the skin still attached, 
and skins from coloured pigs, even if dehaired, can still exhibit coloured hair 
roots, which can lead to negative perception by the consumer, since the surface 
of the meat may appear to be spotted by mould. It is therefore necessary in 
these markets to remove the skin from such carcasses, entailing additional cost 

For example, in the US, coloured carcasses are associated with approximately 
1% of skin defects requiring dehairing and skinning to remove pigment. As a 
result of this, coloured pig carcasses are generally discounted. Secondly gross 
variation in the appearance of pigs claimed to be genetically consistent for other 
traits can lead to questions about the consistency and quality of the animals in 
the mind of pig-producing customers. Breeders would also like to be able to 
ensure consistency in breeding populations. Thus breeders may wish to ensure 
that progeny produced by breeding crosses were always white. Alternatively a 
breeder producing a coloured breed may wish to ensure that the correct coat 
colour characteristics were maintained even during the introgression of genes 
from white lines. 
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White coat colour in pigs is controlled by the dominant white locus designated / 
(for inhibition of coat colour). Structural alterations in the porcine KIT gene 
have recently been correlated with various alleles of / and are probably 
responsible for the differences in coat colour pattern found or are closely linked 
5 to the mutations that are found (Johansson Moller et aL 1996 Mammalian 
Genome 7, 822-830). Four structural versions of the porcine KIT gene have 
been identified to date and designated I, F, I* and i (see figure 1 and table 1). 
The version found in fully coloured animals including wild boar and which is 
therefore generally accepted as the wild type allele is /. The other versions of 
10 the gene known all involve duplication of at least part of the porcine KIT gone. 
F is a partially dominant allele and causes in the heterozygous state (F/i) a 
phenotype "patch" characterised by patches of white and coloured coat. / and /* 
are both fully dominant and cause a white phenotype both in the heterozygous 
and homozygous state. The only difference between / and /* is that there is a 
15 4bp deletion in intron 18 in one of the two KIT gene copies associated with the 
/ allele. This sequence polymorphism is not expected to have any functional 
effect. 

The phenotypes of a number of gene combinations are listed below with, the / 
20 allele being dominant over F and /, and F being dominant over i (Johansson et 
aL 1992 Genomics 14, 965-969) 



Genotype Colour 

/// White 

I/F White 

25 Hi White 



• 



F/i 



Patched 



Hi 



Coloured 



/*// 



White 



5 Previously Moller Johansson et aL (1996 Mammalian Genome 7, 822-830) 

revealed that KIT occurs as a single copy gene in coloured (i/i) animals but is 
duplicated in the /, f and F allele. This duplication allowed the differentiation 
of/, /* and F from i through examination of the gene copy number of KIT and 
to use linked polymorphisms in or in the near vicinity of the KIT gene to 
10 distinguish different alleles at the I locus. This approach is the subject of 
International patent WO97/05278. 

A problem remaining from this method however is that it does not distinguish 
between /* and F. The result of this is that if one wishes to use the screen to 

15 remove heterozygous carrier animals of genotype I/F from a white pig line one 
also excludes I/I* animals unnecessarily. The removal of animals, potentially 
very valuable if at the top of a breeding pyramid, unnecessarily can lead to a 
reduction in the rate of improvement of other traits within that particular group 
of animals. Other consequences include a general loss of genetic diversity and a 

20 loss of alleles at the locus in question that may have as yet undetected value. It 
is essential that one only excludes alleles where absolutely necessary. 

Further sequence analysis of genomic DNA revealed a mutation at the first 
nucleotide in intron 17 of KIT2 leading to a defect in the splicing of RNA 
25 transcribed from this particular copy of the KIT gene. In experiments involving 
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several pig breeds, this mutation showed a complete concordance with the 
presence of the / and /* alleles but was not found in the i and F alleles 

The majority of eukaiyotic genes are composed of both coding (exon) and 
5 intervening noncoding (intron) regions. The latter of these sequences are 
removed from large pre-mRNA by a highly accurate cleavage and ligation 
reaction known as splicing. The result is a mature mRNA transcript, devoid of 
intron sequences, which is transported to the cytoplasm for translation. The 
splicing of eukaryotic genes is most likely a two step procedure. First, the pre- 

10 mRNA is cleaved at the 5' (donor) splice site followed by cleavage at the 3* 
(acceptor) splice site. The second step involves rejoining of the spliced exons to 
result in exclusion of the intervening intron. Splicing is therefore critically 
dependant on the accuracy of the cleavage and ligation reactions. This accuracy 
appears to be dependant on the almost completely invariant GT and AG 

15 dinucleotides present at the 5* and 3' exon/intron boundaries respectively. 
These dinucleotides and the often highly conserved surrounding sequences are 
known as splice sites and serve to bind the protein factors required to perform 
the cleavage and ligation reactions. 

20 The consequences for mutation occurring within a splice site may be a 
reduction in the amount of mature rnRNA produced and/or utilization of 
alternative but incon-ect splice sites in the vicinity. The result is production of 
mRNA which either contains additional intron sequence or which may lack a 
portion of coding sequence. Where mutation occurs within the 5* (donor) splice 

25 site and prevents binding of protein factors, the exon is no longer recognised as 
such and is excised along with its neighbouring introns. This is referred to as 
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'exon skipping 5 (as reviewed by Cooper and Rrawczak, 1994 Human Gene 
Mutation. Bios Scientific Publishers, Oxford, UK, 1994), 

In the mutation identified here the G of the conserved GT pair at the 
5 exonl7/intronl7 boundary region is altered to an A. That there is an alteration 
in the messenger RNA that is translated into protein and that the presence of the 
change correlates with the white/patched phenotypes suggests that this is the 
functional mutation rather than merely a linked marker. The splice variant is 
expected to give rise to a defective protein as 41 amino acids in the mature 

10 protein are missing. We therefore assume that this splice mutation is the causal 
mutation for the difference between / and f. Our current knowledge as regards 
molecular differences between alleles at the / locus is summarised in Table 1. In 
conclusion, we have identified two functionally important mutations. One is the 
gene duplication present in F, /* and / which by itself appears to cause the 

15 patch phenotype. The exact reason for this phenotypic effect is not known but 
one can speculate on the basis of comparative data from the mouse that the 
duplicated copy of the KIT gene may lead to a defect in gene expression which 
in turn affects melanocyte migration. This is especially valuable in that there 
can be no breakdown of the linkage between the DNA polymorphism and the 

20 trait itself. This has allowed us to develop a range of assays for the 
determination of the presence of the DNA polymorphism and the genotype of 
the animal with regard to coat colour determination to a significantly greater 
extent than has been possible before. The second mutation is the splice mutation 
that occurs in one of the KIT copies associated with the dominant white (/ and 

25 /*) alleles. The expression of a truncated form of the KIT protein is expected to 
cause a more severe defect in KIT function and a more severe effect on coat 
colour* 
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Table 1. 
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Allele 


KIT gene 


Intron 17 


Associated Phenotype 


i 


KIT1 


Normal 


Coloured 


F 


KIT1 


Nonnal 


Patch 




KTT2 


Normal 




I 


KIT1 


Normal 


Dominant White 




KIT2 


Mutated 




i* 


KIT1 


Normal 


Dominant White 




KIT2 


Mutated 





In addition to these two functionally important mutations, a mutation in the KIT 
20 gene with no known phenotypic effect, the 4 bp deletion in intron 18 has been 
documented at the / locus as described in International patent application No/ 
PCT/GB96/01794. 

Thus, in a first aspect the present invention provides a method for determining 
coat colour genotype in a pig which comprises: 

25 (a) obtaining a sample of pig nucleic acid; and 

(b) analysing the nucleic acid obtained in (a) to determine whether a mutation 
is/is not present at one or more exon/intron splice sites of the KIT gene. 

In particular, the method determines whether a mutation is/is not present at the 
30 exon 17/intron 17 boundary, eg the substitution of the G of the conserved GT 
pair for A. 
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Reverse Transcriptase based Polymerase Chain analysis (RT-PCR) of the exon 
16-19 region of KIT mRNA in animals of the Large White breed (III) and 
comparison to that transcribed in Hampshire animals (i/i) revealed an extra 
species of molecule in the former animals. RT-PCR analysis of both breeds 
5 yielded a product fragment with a length of 424bp indicating that both types of 
animal contained a Js3TmRNA transcript containing a region corresponding to 
this region of the gene. However in addition the /// animals yielded a RT-PCR 
product of 301bp indicating the presence of an mRNA species which did not 
contain the full transcription of the exon 16-19 DNA sequence (see example 1). 
10 These two transcripts have been shown to be derived from the separate 
duplicate copies of the KIT gene associated with the / allele. Sequencing over 
the KIT exon 17/intron 17 boundary revealed a difference in the intron 
boundary sequences present in the two duplicate copies of the KIT gene 
associated with the / allele (see example 2). The sequences are as shown 



15 below: 

Allele Kit Gene Exon 17 Intron 17 

i KIT1 ....AAC GTG 

F KIT1 AAC GTG 

/ KIT2 AAC GTG 

20 J KIT1 ....AAC GTG 

/ KIT2 AAC ATG 



This alteration of the 5' intron splice site from a GT pair to an AT pair affects 
the splicing of the pre mRNA and results in the loss of the whole of exon 17 
from the mRNA transcribed from the 7-KIT2 sequence. This will result in a 
25 modified KIT protein with the associated alterations in function and phenotype. 
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Based upon the sequence polymorphism rapid tests can be developed to 
determine the alleles carried by a specific animal at the dominant white locus. 

5 In one form such a test would comprise amplification of the region through the 
polymerase chain reaction (PCR) utilising genomic DNA from the animal in 
question as a template. The nucleotide sequence CATG comprises a recognition 
sequence for the restriction enzyme NlaUL. This sequence is only present in the 
I-KIT 2 sequence at the junction position and thus one can differentiate the 
10 DNA molecules amplified from the two alleles by digestion of the amplification 
products with this enzyme or any other restriction enzyme with a suitable 
recognition site. 

Genomic DNA for use in such a test can be prepared by a wide range of 
15 available methods, from any tissues or products derived from the animal in 
question. A 175bp fragment of the KIT gene containing the exon 17/intron 17 
boundary region can be amplified from porcine genomic DNA using a pair of 
primers such as: 

20 KIT21 (5*-GTA TTC ACA GAG ACT TGG CGG C-3'); and 

KIT35 (5 s - AAA CCTGCA AGG AAA ATC CTT CAC GG-3'). 

The use of such primers in a PCR-RFLP test yields a fragment of 175bp before 
digestion with NIalll. In alleles with the G present at nucleotide position 1 of 
25 intron 17 (I-KITl type sequence) there is only one NIalll cleavage site present 



41bp from the end of the fragment. This site is present in all versions of the 
KIT sequence. Thus digestion of the 175bp fragments obtained from i and F 
alleles yields two fragments of 134bp and 41 bp. Where the G is mutated to A 
as in the I-KITl sequence present in / and /* a further Nlalll cleavage site is 
created and thus digestion yields products of 80bp and 54bp and 41bp (see 
figure 3, example 2). A number of other oligonucleotides suitable as PGR 
primeis could easily be derived from the sequence of this region of the porcine 
genome. An example of such a PCR-RFLP test is given in example 3. The 
results that would be obtained from such a PCR-RFLP test described above are 
as shown below: 



Genotype 


Fragment sizes 


Fragment sizes 




134 + 41 bp 


80 + 54 + 41 bp 


III 


Yes 


Yes 


I/f 


Yes 


Yes 


m 


Yes 


Yes 




Yes 


No 


f/i 


Yes 


No 


Hi 


Yes 


No 


1*11 


Yes 


Yes 


1*11* 


Yes 


Yes 


ill* 


Yes 


Yes 


l*/f 


Yes 


Yes 



Thus, simply by analysing restriction products certain genotypes can be 
distinguished. However, there are others which cannot be distinguished in this 
first configuration of the test In a further refinement the test can be carried out 
in such a way that the amount of each fragment can be calculated. By carrying 
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out electrophoresis on an apparatus that allows the quantification of each of the 
bands one can determine the ratio of the two forms of template in the genomic 
DNA sample used. Examples of such an apparatus include the Perkin Elmer 
Applied Biosystems 373 and 377 DNA sequencing systems. The application of 
this type of equipment is illustrated in example 4. Any equipment capable of 
determining the relative amounts of the products from the two different 
sequences is equally applicable to such tests. The expected results from such a 
test are shown below. 



Genotype 
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/// 
Iff 
III* 

in 

F/F 
f/I* 
F/i 

i*n 
1*11* 

i/i 



Normal KIT 
sequence 
Copies 
2 
3 
2 
2 
4 
3 
3 
2 
2 
2 



Splice mutant KIT 
Copies 

2 
1 
2 
1 
0 
1 
0 
1 
2 
0 



Ratio Normal 
Splice mutant 



1 
3 
1 
2 
0 
3 
0 
2 
1 
0 



Thus, using such a test one could identify all animals carrying alleles of 
dominant white that might dispose themselves or their offspring to exhibiting 
non- white coat colour (i orF) as those giving a ratio other than one. Depending 
on the requirement and the derivation of the lines under selection one could take 
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an appropriate subset of animals. For example in a cross derived animals 
carrying only alleles I and i one could identify any white individuals (/// or III) 
carrying i as they would have a ratio in the test of 2 as opposed to 1 for the /// 
animals. The distinction of alleles 7 and 7* can be carried out on the basis of the 
5 4bp deletion as described in the previously filed patent publication 
WO97/05278. 

There are a range of techniques by which differentiation of alleles containing 
the splice mutation and those containing the normal sequence could be 
1 0 differentiated by a person expert in the field. 

Analysis of the genetic composition of an animal could be based upon a number 
of different source materials. These include genomic DNA, RNA and the KIT 
protein itself. There may also be effects on the levels and nature of other 
15 proteins, metabolites and RNA species which could be measured to create a 
more indirect assay. 

DNA could be used as the basis for a number of approaches to testing. One 
approach is through the amplification of the region of DNA containing the 

20 polymorphism using the polymerase chain reaction. This could then be linked to 
a number of forms of analysis of the product. Examples of electrophoresis 
based technologies include Single Strand Conformation Polymorphism (SSCP), 
Restriction Fragment Length Polymorphism (RFLP) and DNA sequencing of 
PCR products or direct genome sequencing. Other PCR based techniques that 

25 might alternatively be used include the Perkin Elmer TaqMan systems, Single 
Nucleotide Polymorphic Extension (SNuPE) and Minisequencing. PCR 
products from the region might also have application in hybridization based 
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approaches to the differentiation of alleles at this locus. Hybridization methods 
might include the probing of Southern transfers of genomic DNA with allele 
specific oligonucleotide, RNA, DNA fragment or Protein Nucleic Acid (PNA) 
probes. Other strategies of application here include hybridization of genomic 
5 DNA or PCR products (specific or whole genome) to oligonucleotide arrays 
possibly in the form of 'DNA chips'. Such arrays could consist of any reagent 
capable of binding DNA or RNA derived from the locus in question in an allele 
specific manner. Further useful methods of analysis also include 
oligonucleotide ligation assay and the Iigase chain reaction. For a review on 
10 methods for detecting point mutations see Landegren, 1996, Laboratory 
Protocols for Mutation detection, Oxford University press, Oxford. 

A number of effects on the RNA produced from the gene in question have 
already and may in the future be observed. All such differences between the 

15 mutated and normal forms of the gene are useful targets for the determination of 
genotype and a large range of methods are available to the person skilled in the 
art The changes that are or might be observed and methods of analysis are as 
follows. Alteration of the size, rate of processing, stability and quantity of RNA 
transcripts could be measured through widely used techniques such as northern 

20 blotting and RT-PCR as well as a number of the techniques described above for 
DNA analysis such as hybridization to oligonucleotide or DNA fragment 
arrays. 

Another approach which can be used is to use a linked genetic polymorphism 
25 which is closely associated with the presence or absence of the alteration at the 
exon/intron boundary. Such a polymorphism may occur in the KIT gone itself or 
in a chromosomal region linked to KIT. By using a single linked marker in 
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complete association with the presence/absence of the duplication or a 
combination of markers showing a partial association a highly informative test can 
be developed. For instance, the SSCP (Single Strand Conformation 
Polymorphism) method may be used to develop such polymorphism. The 
5 principle of the method is that double-stranded DNA, produced by PCR, is 
denatured into single-stranded DNA which is then separated by non-dmaturating 
gel electrophoresis. Under non-denaturating conditions the single-stranded DNA 
forms a secondary structure due to intra-strand interaction but a proportion of the 
single-stranded DNA will renature and form double-stranded DNA. Two types of 
10 polymorphism may be revealed by this method. Firstly, a difference in nucleotide 
sequence between two alleles may influence the secondary structure of 
single-stranded DNA which is revealed as a difference in the mobility rate during 
electrophoresis. Secondly, a difference in nucleotide sequence often influences 
the mobility of the heteroduplex DNA (A heteroduplex is a double-stranded DNA 
1 5 molecule formed by two single-stranded molecules representing different alleles). 

Association between genetic markers and genes responsible for a particular trait 
can be disrupted by genetic recombination. Thus, the closer the physical distance 
between the marker and the gene in question, the less likely it is that 
20 recombination will separate them. 

It is also possible to establish linkage between specific alleles of alternative DNA 
markers and alleles of DNA markers known to be associated with a particular 
gene (e.g. the KIT gene discussed herein), which have previously been shown to 
25 be associated with a particular trait Thus, in the present situation, taking the KIT 
gene, it would be possible, at least in the short term, to select for pigs with a 
particular coat colour, indirectly, by selecting for certain alleles of a KIT gene 
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associated marker through the selection of specific alleles of alternative 
chromosome 8 markers. Examples of such markers known to be linked to the KIT 
gene on porcine chromoso 

itself or in the closely linked genes for the a-subunit of platelet derived growth 
5 factor (PDGFRA) and albumin. 

Particular genetic markers associated with the KIT gene are microsatellites. These 
are simple sequence repeats of 4, 3 or, more usually, 2 nucleotides, which occur 
essentially at random around the genome at approximately every 50,000 bases 
10 (about 60,000 microsatellites per haploid genome). Stuttering of DNA 
polymerase during replication and unequal crossing-over during recombination 
are thought to result in the loss or gain of repeat units. This means that 
microsatellites are usually polymorphic and can have several repeat length alleles. 

15 Examples of linked microsatellite sequences include S0086 (Ellegren et al, 
Genomics, 16:431-439 (1993)) , S0017 (Coppieters et al 9 Animal Genetics 24: 
163-170 (1993)), Sw527, Swr750 and SW916 (Rhorer et a!, Genetics, 136:231- 
245 (1994)) . It would be possible to select indirectly for alleles of the KIT gene 
linked to coat colour using any of the above markers, or indeed any other linked 

20 markers on porcine chromosome 8. 

Alterations in the level of the KIT protein could be measured using either 
specific antibodies for example in an ELIS A system or on western blots or 
through the use of a range of biochemical techniques to measure the activity of 
25 the protein. Such tests could also be applied to other proteins or metabolites, 
the level or nature of which is altered by the presence of specific alleles at the 
locus. Different protein structures due to the presence of specific alleles could 
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be identified through the use of structure specific antibodies. As with the DNA 
and RNA based methods all these protein methods could be applied in a 
quantitative fashion mus achte^ 

Kits could be produced for the specific analysis of the polymorphism described 
here alone and also with reagents allowing the combined analysis of the other 
polymorphisms previously reported and the subject of patent publication 
WO97/05278. 

The invention will now be described with reference to the following examples, 
which should not be construed as in any way limiting the invention. The 
examples refer to the figures in which: 

Figure 1: is a schematic representation of the structure of the known 
porcine JOT alleles, where 4bp del'n refers to the 4bp deletion in intron 
18 of one copy of the duplicated KIT gene DNA as reported by Moller et 
ah 1996 and Exon 17-A refers to the change of nucleotide 1 of intron 17 
from a G as in the wild type allele to an A as reported in this patent; 

Figure 2: shows an electropherogram (4% agarose Nusieve/Seakem 3:1; 
100V for 80 min) showing RT-PCR products of ATT exon 16-19 with the 
primers KIT IF and KIT7R. The samples 1-3 and 4-6 are Swedish Large 
White and Hampshire pigs, respectively. The size difference between 
the 424 and 301 bp fragments is due to lack of exon 17 in the latter 
fraction. The two upper bands of the Yorkshire pigs were interpreted as 
heteroduplexes (HD); 
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Figure 3: shows a 48 bp sequence comprising 21 bp of KIT exon 17 and 
27 bp of KIT intron 17 where the position of the intron/exon border is 
marked with a vertical line , the splice site mutation (ntl G ^ A ) indicated 
with a vertical arrow and identical bases in alleles F and i are marked 
with a dot; 

Figure 4: shows the results of NlaUl PCR RFLP test used to detect the 
presence of a splice site mutation in intron 17 of the KIT gene. Figure 
4A shows the position of two NlaUl recognition sites within the PGR 
product amplified using primer pair KIT21 and KIT35. All distances are 
given in base pairs. Figure 4B shows the size of fragments which result 
following NlaUl digestion of either normal KIT or splice mutant KIT. 
Figure 4C illustrates use of the PCR RFLP test Lane 1 shows the 
KIT21/KIT35 amplified fragment undigested. Digestion was performed 
on PCR products amplified from, in Lane 2: a clone which contains the 
splice site mutation; Lane 3: a clone which contains the normal splice 
site sequence; Lane 4; genomic DNA from a coloured pig; Lane 5: 
genomic DNA from a white pig. Fragment sizes are given in base pairs; 

Figure 5: shows a comparison of the ratio of normal to splice mutant 
KIT in animals of genotypes I/I, I/i and I/F\ 

Figure 6: shows the ratio values for 56 Landrace and 33 Large White 
animals. A clearly bimodal distribution is observed with 7 Landrace and 
3 Large White individuals having a ratio value of approximately 3 or 
above, suggesting them to be heterozygous carriers for the f allele 
(genotype I/f\ This means f has gene frequency estimates of 6.25% 
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(7/1 12 chromosomes tested) and 4.5% (3/66 chromosomes tested) within 
the Landrace and Large White breeds respectively; and 

FIGURE 7: shows a plot of Ct FAM versus Ct TET for animals of 
genotypes li and // analysed for KIT splice mutant genotype using 
TaqMan® chemistry. 

Example! 

RT-PCR of porcine KIT exon 16-19 

i. mRNA purification from blood samples 

Fresh blood samples were collected in citrate tubes from coloured Hampshire 
pigs and Large White pigs* Leukocytes were isolated from 5 ml blood using 
Ficoll 100 (Pharmacia Biotech). Isolation of mRNA from leukocytes was then 
carried out using the Quickprep Micro mRNA purification kit (Pharmacia 
Biotech), The mRNA was stored as a precipitate under ethanol at -70°C for up 
to one month before use in reverse transcriptase (RT)-PCR. 

ii. RT-PCR of KIT exon Wl9 

First-strand cDNA synthesis was accomplished using the First-Strand cDNA 
Synthesis kit (Pharmacia! Biotech) so that -100 ng mRNA was randomly 
primed by 0.1 pg pd(N6) in a total volume of 15 jjlI. Two pi of the completed 
first cDNA strand reaction! was then directly used per 12 pi PCR reaction by 
adding 10 pi PCR mix containing 10 pmol each of the mouse/human derived 
primers KIT1F and KIT7R (5'-TCR TAC ATA GAA AGA GAY GTG ACT C 
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and 5'-AGC CTT CCT TGA 



TCA TCT TGT AG, respectively; Moller et aL 
1996, supra), 1.2 ptl 10 x PCB -buffer (10 mM Tris-HCI, pH 8.3, 50 mM KC1) 
and 03 U of AmplIT^ (Pefldn-Elmer) incubated with an equal 

amount Taqstart antibody (Clpntech) at 25 °C for 5 min to achieve a hot start 
5 PCR. The reaction was covered with 20 jil mineral oil and thermocycled in a 
Hybaid Touchdown machine (Hybaid) with 40 cycles at 94°C for 1 min, 55-48 
°C (touchdown one degree per cycle the first seven cycles and then 48°C in the 
remaining cycles) for 1 mil and 72°C for 1 min. After PCR 2pJ loading dye 
was added to each sample which were then loaded on 4% agarose gel 
1 0 (Nusieve/Seakem 3:1, FMC Bioproducts) and electrophoresed with 1 00V for 80 
min. Products were visualised by ethidium bromide staining and UV- 
illumination. i 



iiL Cloning and sequencing of RT-PGR-products 

15 The RT-PCR products representing KITexon 16-19 were purified by extraction 
from 2% agarose gels using the QIAEX gel extraction kit (QIAGEN) and 
cloned into the pUC18 vector Using the Sureclone ligation kit (Pharmacia 
Biotech). Plasmids were isolated using the QIAFilter plasmid Midi kit 
(QIAGEN). Cloned plasmid inserts were sequenced using dye primer 

20 chemistry. Each cycling reaction was prepared with plasmid template DNA and 
ready reaction mix containing fluorescently labelled Ml 3 forward or reverse 
primer as described in the ABI Prism protocol P/N 402113 (Perkin Elmer). 
Cycling and sample pooling were performed using a Catalyst 800 Molecular 
Biology Workstation (ABI) following the instruments user manual (Document 

25 number 903877, Perkin Elmer). The resulting extension products were purified, 
loaded and analysed using the 377 ABI Prism sequencer as described by the 
instrument protocol P/N 402078 (Perkin Elmer), 
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iv Results and discussion 

A 424 bp fragmenrincluding KI^cDNAexon 16A9 was amplified from all 
pigs. The Hampshire pigs did not show any additional products whereas the 
Large White pigs (eight tested) all showed a 301 bp truncated cDNA fragment 
(Fig 2). Sequence analysis revealed the 424 bp fragment was identical in the 
two breeds whereas the whole exon 17 (123 bp) was missing from the 301 bp 
fragment. Apparent differences between individuals regarding the relative 
amounts of these two products may have been caused either by different 
genotypes containing differing numbers of copies of the KIT gene sequence, 
individual differences in mRNA expression levels or random RT-PCR effects. 

The two upper fragments present in Large white pigs represent heteroduplexes 
between the 301 and 424 bp fragments (Fig. 2). This was shown by an 
experiment where these slow migrating fragments were generated by pooling 
homoduplexes of the 424 and 301 bp which were then heat denatured and 
cooled to 25°C. Moreover, cloning of the lower heteroduplex fraction of a Large 
White pig resulted in clones with insert length corresponding to either of the 
two homoduplexes. 

Example 2 

PCR Amplification and Sequencing of K TT Fxon 17-Intron 17 (5' Splice Site) 
i. PCR to produce DNA Sequencing Template 

A 175 bp region including the boundary between exonl7 and intronl7 of the 
KIT gene was amplified for sequence analysis using forward primer KIT21 (5' - 
GTA TTC ACA GAG AGT TGG CGG C -3') and reverse primer KIT35 (5' - 
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AAA CCT GCA AGG AAA^ATC CTT CAC GG - 3'). PCR was carried out on 
a DNA thermal cycler (Peikin Elmer 9600) in a total volume of 20 yl 



containing 25 ng genomic DNA, 1.0 mM MgCl 2 , 50 mM KC1, 10 mM Tris- 
HC1, pH 8.3, 200 ^iM dNTPsL 0.5 U AmpliTaq Gold (Perkin Elmer) and 10 
pmol of both KIT21 and KIT35 primer. To activate AmpliTaq Gold, initial heat 
denaturation was carried out at ¥?4°C for 10 minutes followed by 32 cycles each 
consisting of 45 sec at 94°C, 45 sec at 55°C and 45 sec at 72°C. The final 
extension lasted for 7 min at 72°C. PCR products were cloned into vector 
pUCl 8 using the SureClone ligation kit (Pharmacia Biotech), 

ii. Preparation of Plasmid DNA 

Plasmid DNA was purified from overnight bacterial culture using the Jetstar 
plasmid midi kit (Genomed) and the resulting DNA diluted to 150 ng/pl 



15 Hi. Sequencing of plasmid DNA 

DNA was sequenced as in example 1 section iii- 



i v. Results 

A portion of the DNA sequence from exon 17 and mtron 17 of the KIT gene 
20 was determined and compared between animals with each of these three alleles. 
Figure 3 shows that the / allele carries a splice site mutation at position 1 of 
intron 17. This G to A base substitution is present in one of the two gene copies 
carried on each chromosome. The base substitution occurs in the invariant GT 
dinucleotide which characterises 5' exon/intron boundaries. Analysis of the f 
25 allele showed the splice site mutation was not present in either the normal 
(KIT1) or duplicated copy of the gene (KIT2). We have found the splice site 



mutation is unique to the / alleles, and therefore makes it possible to distinguish 
the I-KIT2 sequences. 



Example 3 

5 

Testing For the Presence of the Splice Site Mutation wit h PCR RFLP 

To easily test for the presence of the G to A splice site mutation, restriction 
endonuclease MalH (CATG) was used to exploit the point substitution 
10 identified at position 1 of intron 17 (Figure 3). The Malll recognition sites in 
the fragment amplified from KIT and the expected restriction products are 
illustrated in Figure 4A and 4B respectively. 

15 

i. DNA preparation for RFLP Test 

DNA can be prepared from any source of tissue containing cell nuclei, for 
example white blood cells, hair follicles, ear notches and muscle. The 
procedure here relates to blood cell preparations; other tissues can be processed 
20 similarly by directly suspending material in K buffer and then proceeding from 
the same stage of the blood procedure. The method outlined here produces a 
cell lysate containing crude DNA which is suitable for PCR amplification. 
However, any method for preparing purified, or crude, DNA should be equally 
effective. 

25 

Blood was collected in 50mM EDTA pH 8.0 to prevent coagulation. 50^1 of 
blood was dispensed into a small microcentrifuge tube (0.5ml Eppendorf or 
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equivalent). 450ul of TE buffer was added to lyse the red blood cells (haem 
groups inhibit PCR) and the mix vortexed for 2 seconds. The intact white and 
residual red blood cells were then centrifuged forT2 seconds at 13,000 g in a 
microcentrifuge. The supernatant was removed by gentle aspiration using a low 
pressure vacuum pump system. A further 450(il of TE buffer was then added to 
lyse the remaining red blood, cells and the white blood cells collected by 
centrimgation as before. If any redness remained in the pellet, this process was 
repeated until the pellet was white. After removal of the last drop of 
supernatant from the pelleted white blood cells, lOOul of K buffer containing 
proteinase K was added and the mixture incubated at 55 degrees C for 2 hours. 
The mixture was then heated to 95-100 degrees C for 8 minutes and the DNA 
lysates stored at -20°C until needed. 

Reagents 

T.E. Buffer: 1 OmM TRIS-HC1 pH8.0 

ImM EDTA 

K Buffer: 50mM KC1 

10mMTRIS-HClpH8.3 
2.5mM MgC12 
0.5%Tween20 

ii. Restriction En2yme Digestion and Electrophoresis 

The PCR amplification product is 175 bp in length. To test for polymorphism at 
position 1 of intron 17, digestion reactions were set up as below: 

3.0 ul PCR amplified DNA 
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1.0 ill 10XNEBuffer4 
0.1 pi BSA 100 ug/ml 

0.1 |ilMaIH10U/ul 

5.8 dH20 

5 

(1 X NEBuffer 4 (New England Biolabs) contains 50 mM potassium acetate, 20 
mM Tris acetate, 10 mM magnesium acetate and 1 mM DTT). Following 
incubation at 37°C for 90 minutes each 10 ul reaction volume had 2 ul of 
loading dye added and the mix loaded on a 8% native polyacrylamide gel 
10 (Protogel, 37.5:1 acrylamide:bisacrylamide , National Diagnostics, Atlanta) in 
0.5 X TBE (44.5 mM Tris pH 8.0, 44.5 mM boric acid and 0.5 mM EDTA) and 
electrophoresed for 3 hours at 200V in a vertical slab unit (SE600 Hoefer 
Scientific Instruments). Products were visualised by ethidium bromide staining. 

15 iii Results 

A PCR RFLP protocol was designed to test for the presence of the splice site 
mutation as the substitution occurs within the recognition site for restriction 
endonuclease Main. Figure 4B illustrates that presence of the G to A base 
substitution at position 1 of KIT nitron 17 results in restriction at each of two 

20 NlaUI recognition sites within the 175 bp DNA fragment. Following 
electrophoresis, this results in fragments of sizes 80 bp, 54 bp and 41 bp. Where 
the splice site mutation is absent however, incubation with NlaUI results in 
digestion only at recognition site 1. Following electrophoresis this results in 
fragments of 134 bp and 41 bp. The invariant Malll recognition site 1 serves as 

25 an internal control to ensure complete digestion has taken place. Results of this 
PCR RFLP analysis are illustrated in Figure 4G. Analysis was performed on 
fragments amplified from clones which either carry the splice site mutation 
(lane 2) or carry the normal splice site sequence (lane 3). Lane 4 shows the 
result of analysis where DNA amplified from the genomic DNA of a coloured 
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animal was used. Lane 5 shows the resulting bands where a white animal was 
tested. The test was used to analyse 121 individuals from seven different breeds 
of pig- The splice site mutation was found only in the 97 animals " with lhe~ 
dominant white phenotype (//- or and none of the 24 coloured (f or /) 
5 examples (Table 2). This analysis confirms / and /* to be unique in that they are 
the only alleles to cany the splice site mutation. 

TABLE 2 

Distribution of the Splice Site Mutation Between Different Breeds and Coat 
10 Phenotype 



Breed 


Coat 


Assumed 


Animals 


Normally 


Splice 




Colour 


Genotype 1 


Tested 


spliced KIT 2 


Mutation 2 


Large White 


White 


I/- 


33 


33 


33 


Landrace 


White 


V- 


56 


56 


56 


Hampshire 


Coloured 


i/i 


5 


5 


0 


Duroc 


Coloured 


i/i 


5 


5 


0 


Pietrain 


Coloured 


i/i 


8 


8 


0 


Meishan 


Coloured 


i/i 


5 


5 


0 


Wild Boar 


Coloured 


i/i 


1 


1 


0 


Wild Boar 


White 


I*/- 


8 


8 


8 


x Large 












White 












Totals 














White 


I/- 


89 


89 


89 




White 


I*/- 


8 


8 


8 




Coloured 


i/i 


24 


24 


0 



1 White animals may be homozygous or heterozygous for the / allele 

2 Presence of the splice site mutation determined by NlaYLl PCR RFLP test 
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Example 4 

Quantification Mnrmal KTT and S plice Mu tant KTT f mtron 17 ntl G ~* A ) 

As the splice site mutation is present in only one of the duplicated regions of/ 
and not in the duplicated region off, the various genotypes can be expected to 
have the attributes described in Table 3. 
TABLE 3 



Genotype Copies of Normal Copies of KIT Ratio of normal KIT 

jqX containing the splice to splice mutant KIT 

mutation 



I/I 

I/i 

i/i 

I/I p 

I p /i 



2 
2 
2 
3 
3 



2 
1 
0 
1 
0 



1:1 
2:1 
2:0 
3:1 
3:0 



Due to the dominance of allele /, three of the genotypes in Table 2 are carried 
by white animals and therefore can not be identified by phenotypic 
characterisation. Quantification of the relative amounts of the normal KIT gene 
and the splice mutant KIT gene allows the ratio between the two to be 
calculated, and therefore the genotype of individual animals predicted. This was 
achieved by quantification of two DNA fragments following Nlaia digestion. 
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The amount of 134 bp fragment, representative of the normally spliced KIT 
gene, and of 54 bp fragment, representative of the splice mutant KIT, were 
ini^uredToTlowing electrophoresis using UeneJScan software. 

5 L PCR to Produce DNA for Quantification 

As described in example 2 section i. The reverse primer KIT35 is labelled with 
the ABI fluorescent dye FAM at the 5' end, 

ii Restriction En2yme Digestion 

10 As described in example 2 section iL 

iii Electrophoresis and Quantification of DNA Fragments 

Following digestion, 0,5 jjl! of the reaction volume was mixed with 2.5 pi of 
deionised formamide, 0.5 p.1 of GS350 DNA standard (ABI) and 0,4 jil blue 

15 dextran solution before being heated to 90°C for 2 minutes and rapidly cooled 
on ice. Three jal of this mix was then loaded onto a 377 ABI Prism sequencer 
and the DNA fragments separated on a 6% polyacrylamide gel in 1 X TBE 
buffer for 2 hours at 700 V, 40 mA, 32 W. The peak area of fragments 
representative to both the normal and splice mutant forms of KIT were 

20 quantitated using the GeneScan (ABI) software. 

iv. Ratio Calculations 

The peak area value of the 134 bp fragment (normal KIT) was divided by twice 
the peak area value of the 54 bp fragment (splice mutant KIT) in order to 
25 calculate the ratio value for each sample. 
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15 
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v. Results 

Analysis was performed on animals from the Swedish wild pig/Large White 
intercross pedigree for which genotypes at 1 have been determined TSy" 
conventional breeding experiments with linked markers. Figure 5 and Table 4 
show the ratio of normal to mutant JOT calculated for animals from each of the 
three genotype classes, W (expected ratio 1:1), I/i (expected ratio 2:1) and Vf 
(expected ratio 3:1). The results are entirely consistent with the expected ratio 
values and indicate that the three genotype classes can be distinguished using 
this method. 

TABLE 4 

Ratio of the Two KIT Forms in Different Dominant White Genotypes in a 
Wild Pig/Large White Intercross 



Genotype Phenotype Expected Observed Ratio Number 

Ratio (NormahMutan Tested 
(Normal: t) 
Mutant) 



±SE 



I/I White 1:1 1.15 + 0.075 13 

Vf White 3:1 3.11 ±0.084 12 

I/i White 2:1 2.23 ±0.109 14 



Figure 5 illustrates that the range of ratio values calculated for the two 
genotypes I/I and I/f do not overlap. This enables animals carrying the / allele 
to be identified and the frequency of the allele within different pig breeds 
20 determined. Ratio values were calculated for 56 Landrace and 33 Large White 
animals and the results are shown in Figure 6. A clearly bimodal distribution is 
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observed with 7 Landrace and 3 Large White individuals having a ratio value of 
approximately 3 or above, suggesting them to be heterozygous carriers for the 
~~" ~ ]T all^^ S ene frequency estimates of "6.25% 

(7/112 chromosomes tested) and 4.5% (3/66 chromosomes tested) within the 
5 Landrace and Large White breeds respectively. 

Example 5 

Analysis for presence and quantification of the porc ine KIT splice mutation 
10 using the PE ABI TaqMan chemistry 

Method 

i. Preparation of template DNA for PCR 

15 

DNA was prepared as in example 3, section i 

ii. TaqMan® PCR reactions 

20 TaqMan® PCR reactions were set up as shown in table 5 

TABLE 5 

P CR mix for TaqMan® based splice mutation test 

Reagent Final Cone 11 Volume 

1 Ox TaqMan® Buffer A (Perkin Elmer) 
25mM MgCl 2 Sol a 
DATP 
DCTP 
DGTP 
DUTP 

Amplitaq Gold™ (5U/ul) (Perkin Elmer) 
AmpErase™N-Glycosylase (lU/ul) (Perkin 
Elmer) 

KITTM -NEST-F (5uM) 
KITTM-NEST-R (5|iM) 
KITTM FAM (5uM) 
KITTM TET (5uM) 



lx 


2.50 ul 


5mM 


5.00^1 


200uM 


0.50 jil 


200uM 


0.50 ul 


200uM 


0.50 ul 


200uM 


0.50 \il 


0.05U/ul 


0.25 ul 


O.OlU/u.1 


0.25 ul 


500nM 


2.50 ul 


500nM 


2.50 ul 


lOOnM 


0.50 ul 


lOOnM 


0.50 ul 
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25% Glycerol 8 ' 00 » l 

Porcine genomic DNA LOOM 

25.00 

• . ' tiL_ 

The PCR primers used were as described below: 

KITTM-Nesjt-F (5'-CTC CTT ACT CAT GGT CGA ATC ACA-3') and 

5 1 

KITTM -Nei-R (5'-CGG CTA AAA TGC ATG GTA TGG-3'). 

The TaqMan' 8 probes used were: 

10 KITTM-A FAM (5'-TCA AAG GAA ACA TGA GTA CCC ACG CTC- 
3') and T 

KITTM -G TBT (5'- TCA AAG GAA ACG TGA GTA CCC ACG C -3') 

1 5 The TaqMan® probes were prepared by Perkin Elmer and labelled with FAM 
and TET as indicated as well as the standard quenching group TAMRA. The 
lOx TaqMan® Buffer A, Amplitaq Gold™, AmpErase N-Glycosylase, NTP's 
and 25mM MgCl 2 used were part of the TaqMan® PCR Core reagent Kit, 
supplied by Perkin-Elmer. 

20 

The reactions were then placed into a Perkin Elmer ABI Prism 7700 Sequence 
Detector and the reaction carried out using the following thermal profile, 50°C 
for 2 minutes, 95°C for 10 minutes followed by 40 cycles of 95°C 15s, 62°C 
60s. The reactions were carried out under the control of 'Sequence Detector 
25 V.1.6' software using the 'Single Reporter' and Real-Time' options with the 
'Spectral Compensation' function activated. Upon completion of the run real- 
time profiles for each sample were examined on the ABI7700 to check for any 
samples giving highly irregular profiles which were then excluded. The 
thresholds for both dyes, Fam and Tet, were set so that they intercepted each 
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dye during the exponential phase of PCR. Following updating of the 
calculations in 'Sequence Detector V.1.6' software results were exportated into 
MS Excelfdr ftirthef analysis. 

5 iii. Analysis of results 

Based upon the underlying theoretical principle that one cycle of PCR gives a 
doubling in the amount of cleavage of the quenching dye from the allele 
specific probe and therefore doubles the signal one would expect the threshold 
10 cycle numbers from the II and Ii genotypes analysed to be as below: 

Table 6: 

Theoretical results for TaqMan® analysis of genotype at the KIT splice 
IS mutation 



Genotype 


Copies KIT 1 


Copies KIT 2 


Theoretical Ct 


Theoretical Ct 




(G) 


(A) 


TET(G) . 


FAM (A) 


n 


2 


2 


X 


Y 


n 


2 


1 


X 


Y+l 



In theory the Ct for TET and FAM signals, represented as X and Y should be 
the same, as equal numbers of copies of the target sequences should be present 
in an JI animal. However in practice this does not necessarily occur due to 
20 differences in the hybridization and cleavage efficiency of the two probes and 
variation in the setting of the threshold cycle between the two dye signals. The 
reduction in splice mutant containing (A) sequences relative to those not 
containing the splice mutation (G) in the Ii animals ie 2:1 G:A ratio rather than 
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1:1 as for // genotype, should lead to the FAM signal reaching the threshold 1 
cycle later than the TET signal in the genotype // animals. The actual results for 
samples tested are shown in Table 7. 



Table 7 



Ct values from analysis of EL and li genotypes 


Sample 


Genotype 


Ct FAM (A) 


CtTET 
(G) 


Ct FAM - Ct TET 


1 


Ii 


24.68 


22.59 


2.09 


2 


Ii 


25.98 


23.62 


2.36 


3 


Ii 


26.54 


25.57 


0.97 


4 


Ii 


27.37 


24.78 


2.59 


5 


Ii 


24.94 


21.61 


3.33 


6 


Ii 


25.68 


22.1 


3.58 
Ii Mean = 2.49 


7 


U 


22.05 


23.78 


-1.73 


8 


II 


24.22 


24.59 


-0.37 


9 


II 


24.19 


23.85 


0.34 


10 


II 


23.66 


23.51 


0.15 


11 


II 


24.35 


22.71 


1.64 


12 


II 


22.82 


21.69 


1.13 


13 


II 


22.84 


22.7 


0.14 


14 


II 


23.17 


22.9 


0.27 
Mean = 0.20 



# 



32 



No 35 35 0 

Template 

No 35 35 0 

Template 

No 35 35 0 

Template 

No 35 35 0 

Template 



Despite variation around the mean values it can be seen from Table 7 that there 
is a significantly increased delay in the FAM signal reaching the threshold level 
(approximately 2 cycles) relative to the TET signal in Ii animals compared to II 
5 animals as predicted, reflecting the reduced number of copies of the splice 
mutant (A) sequence present in animals of the Ii genotype. Plotting of the 
individual samples on a scatter plot (Figure 7) shows clustering of the two 
genotypes with the Ii cluster shifted along the Ct FAM axis due to the reduced 
number of copies of the KIT2 (A) sequence for which the FAM probe is 
10 specific. 
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